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Related Applications 

This application claims the benefit of U.S. Provisional Application No. 
60/172,863, filed 20 December 1999, which is incorporated herein by reference in its 
entirety. 

Statement of Federal Support 

This invention was made with United States Government support under grant 
number AI-3 3363 from the National Institutes of Health. The United States 
Government has certain rights to this invention. 

Field of The Invention 

This invention relates to novel compounds that recognize mixed sequences 
{i.e., GC as well as AT base pairs), and specifically bind the DNA minor groove 
through dimer formation. 



Background of the Invention 

Design and discovery of molecules that can regulate gene expression in cells 
in a desirable and predictable mariner is a central goal of research at the interface of 
chemistry and biology. See, e.g., Schreiber, S. L„ Bioorg. Med. Chem. 6, 1 127-1 152 
20 (1998); C. Denison and T. Kodadek, Chem. Biol. 5, R129-R145 (1998); A. G. 
Papavassiliou, Molecular Medicine Today 358-366 (1998); R. E. Bremer, et al., 
Chem. Biol. 5, 1 19-133 (1998); J. Gottesfeld et al., Nature 387, 202-205 (1997); H. 
Iida„ Current Opinion Biotechnology 10 ; 29-33 (1999). The developing field of 
"chemical genetics" requires molecules that have the necessary selectivity to 
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recognize target genes. See. e.g., S. Schreiber, supra, and Schreiber, S., FASEB J. 
ll,p.Ml (1997). 

A number of aromatic diamidines have been shown to bind to the minor- 
groove of DNA, and to exhibit useful antimicrobial activity. Various hypotheses of 
5 the mode of antimicrobial action of the aryl amidines have been proposed. However, 
evidence is growing that these compounds function by complex formation with DNA 
and subsequent selective inhibition of DNA dependent microbial enzymes. 
Intervention in transcription control has been demonstrated and seems to be a 
plausible mode of action for structurally diverse minor groove binders. B. P. Das, et 
10 al. , J. Med. Chem. 20, 531-536 (1977); D. W. Boykin, et al.,J. Med. Chem. , 36, 912- 
916 (1995); A. Kumar et al., Eur. J. Med. Chem. 31, 767-773 (1996); R. J. Lombardy, 
et al., J. Med. Chem. 31, 912-916 (1996); RR. Tidwell. et zl, Antimicrob. Agents 
Chemother. 37, 1713-1716 (1993); R. R. Tidwell, R.R. and C. A. Bell, "Pentamidine 
and Related Compounds in Treatment of Pneumocystis carinii Infection," in 
15 Pneumocystis carinii, (Marcel Decker; New York, 561-583 (1993)); D. Henderson, 
and L.H. Hurley, Nature Med. 1, 525-527 (1995); J. Mote, Jr., et al, J. Mol Biol. 
226, 725-737 (1994); and D. W. Boykin, et al., J. Med. Chem. 41,124-129(1998). 

Organic cations that bind in the DNA minor groove also have biological 
activities that range from anti-opportunistic infection to anticancer properties. See 
20 e.g., C. Bailly, in Advances in DNA Sequence-Specific Agents, Vol. 3, pp. 97-156 (L. 
H. Hurley, Ed. JAI Press Inc., London, UK, 1998); J. A. Mountzouris and L. H. 
Hurley, in Bioorganic Chemistry: Nucleic Acids, pp. 288-323, (S. M. Hecht, Ed.., 
Oxford Univ. Press, New York, 1996); E. Hildebrant, et al.,7. Euk. Microbiol. 45, 
1 12 (1998); and K. Hopkins et al., J. Med. Chem. 41, 3872 (1998). Such compounds 
25 have provided a wealth of fundamental information about nucleic acid recognition 
properties, and they continue to be important models in the study of nucleic acid 
complexes. 

The DNA minor-groove and AT sequence recognition properties of molecules 
of this series have been probed extensively for more than 30 years. See, e.g., C. 
30 Zimmer and U. Wahnert, Prog. Biophys. Mol. Biol. 47, 3 1 (1986); B. H. Geierstanger 
and D. E. Wemmer, Annu. Rev. Biophys. Biomol. Struct. 24, 463 (1995); W. D. 
Wilson, in Nucleic Acids in Chemistry and Biology, Chapter 8 (G. M. Blackburn and 
M. J. Gait, Eds., IRL Press, Oxford, U.K., 1996). The compound netropsin (see FIG. 
1 ) was the first minor groove-binding compound crystallized with a B-form DNA, 
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and the structure of the complex provided clear suggestions about the molecular basis 
for AT base pair sequence-specific recognition. M. L. Kopka, et al., Proc. Natl 
Acad. Sci. 82, 1376 (1985). The structure of netropsin also led to the development of 
minor-groove binding netropsin analogs, the lexitropsins, that could specifically 
5 recognize GC base pairs and could thus have extended sequence recognition 

capability. See, J. W. Lown et al, Biochemistry 25, 7408 (1986); M. L. Kopka and T. 
A. Larsen, in Nucleic Acid Targeted Drug Design, pp. 303-3 74C (L. Probst and T. J. 
Perun. Eds., Marcel Dekker Inc., New York, 1992); and M. L. Kopka et al., Structure 
5, 1033 (1997). Initial efforts in the design of such analogs did provide compounds 
10 with enhanced recognition of GC base pairs, but unfortunately, the specificity 

obtained was not significant. A breakthrough in this area occurred with the discovery 
that the monocationic compound distamycin (FIG. 1) could bind into the minor 
groove of some AT sequences of DNA as a stacked, antiparallel dimer. See J. G. 
Pelton and D. E. Wemmer, Proc. Natl Acad. Sci. 86, 5723 (1989), and J. G. Pelton 
15 and D. E. Wemmer, J. Am. Chem. Soc. 112, 1393 (1990). 

One of the early recognition principles for AT sequences was the fact that the 
minor groove is narrower in AT than in GC regions, and it is perhaps the most 
surprising feature of the dimer complex that the minor groove in B-form DNA can 
readily expand to the width required for dimer binding. The expansion of the groove 
20 not only allows the dimer to bind but also provides for recognition of both strands in 
the duplex through complementary strand recognition by the two molecules of the 
dimer. Replacement of pyrrole group in distamycin by imidazole provided improved 
GC recognition specificity with dimer complexes and current design efforts in this 
system have reached a high level of success. See e.g., C. L. Kielkopf, et ah, Nature 
25 Struct. Biol 5, 104 (1998); S. Whiteet al., Nature 391, 468 (1998); C. L. Kielkopf et 
al., Science 282, 111 (1998); S. E. Swalleyet al., J. Am. Chem. Soc 121, 1113 (1999); 
and D. M. Herman, et al., J. Am. Chem. Soc 121, 1 121 (1999). With recent 
incorporation of hydroxypyrole groups as a recognition unit, AT and TA as well as 
GC and CG base pairs can now be effectively distinguished in DNA sequences by 
30 pyrrole-imidazole polyamides related to distamycin. 

The pyrrole-imidazole polyamide system is the only one of the well-known 
minor-groove binding motifs that has been found to form the stacked-dimer 
recognition unit. Even netropsin, the first minor-groove binding agent to be 
structurally characterized in detail and a dicationic relative of the monocation 
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distamycin (FIG. 1 ), does not form a dimer recognition unit. A recent crystal 
structure of a 2:1 netrospin-DNA complex found that the two netropsm molecules in 
the complex bound in the minor groove as tandem monomer units instead of the side- 
by-side dimer observed with distamycin. See e.g., X. Chen, et al., J. Mol. Biol. 267. 
5 1 157 (1997); X. Chen, et al, Nucleic Acids Res. 26, 5464 (1998); and X. Chen, et al, 
Nature Struct. Biol. 1, 169 (1994). The two charges of netropsin as well as other 
minor groove agents, such as the furan derivatives shown in FIG. 1, have been 
postulated to prevent stacked-dimer formation. 

Recent evidence suggests that some monocationic cyanine dyes can form an 
1 0 array of stacked dimers in the DNA minor groove. See J. L. Seifert, et al., J. Am. 
Chem. Soc. (in press, 1999). There are, however, other monocationic minor-groove 
agents, such as Hoechst 33258 (see FIG. 1 and analogs, that apparently do not form 
dimer DNA recognition motifs. These results indicate that the electrostatic and 
stereochemical requirements for minor-groove recognition of DNA by dimers are 
15 very restrictive, and further suggest that stacked dimer formation by dications is 
unlikely. 



Summary of the Invention 

The present invention is based on the inventors' surprising discovery of a new 
20 class of organic dications, based on unfiised-aromatic systems, that selectively 

recognize mixed DNA sequences {i.e., AT as well as GC base pairs) in a manner that 
is very sensitive to compound structure. These are the first non-peptide compounds 
that have mixed-sequence recognition capability and the result is particularly 
promising, since similar compounds readily enter cells and have generally low 
25 toxicity. See K. Hopkins et al, J. Med. Chem. 41, 3872-3878 (1998). A surprising 
feature of this discovery is that recognition occurs through highly cooperative dimer 
formation at the DNA binding site, a process that has been predicted not to occur for 
dications. The series of compounds provides a synthetically accessible new motif for 
specific recognition of DNA and control of gene expression. Such compounds 
30 accordingly find use in numerous therapies and treatments, including the treatment 
and prevention of opportunistic infections, cancer and other diseases of cell 
proliferation, and disorders of genetic origin (i.e., diseases caused by mutations of 
DNA and the like). Additionally, certain of the compounds of the present invention 
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are fluorescent, and thus are useful for the detection of certain specific sequences 
recognized by the compounds of the invention. 



Accordingly, a first aspect of the present invention is a compound of Formula 

5 I: 




R4 



wherein: 

X is selected from the group consisting of O, S, and NH; 

YisCHorN; 

A is CH or N; 

10 B is selected from the group consisting of NH, O or S; 

Rj is selected from the group consisting of H, loweraJkyl, halogen, oxyalkyl, 
oxyaryl, and oxyarylakyl; 

R 2 and R 9 are each independently selected from the group consisting of H, H 2 , 
hydroxy, lower alkyl, cycloalkyl, aryl, alkylaryl, alkoxyalkyl, hydroxycycloalkyK 
1 5 alkoxycycloalkoxy, hydroxyalkyl, aminoalkyl and alkylaminoalkyl; and 

R3, R4, R13 and R, 4 are each independently selected from the group consisting of 
H, lower alkyl, alkoxyalkyl, cycloalkyl, aryl, alkylaryl, hydroxyalkyl, aminoalkyl, and 
alkylaminoalkyl, or R 3 and R, together or R 13 and R 14 together represent a C 2 to C10 
alkyl, hydroxyalkyl, or alkylene, or R, and R4 together or R 13 and R 14 together are: 




wherein n is a number from 1 to 3, and R I0 is H or -CONHR n NR 15 R 16 , wherein 
Ri 1 is lower alkyl and R 15 and R ]6 are each independently selected from the 
group consisting of H and lower alkyl; 
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L is selected from the group consisting of: 




wherein R5, R$, R7, and Rg are each individually selected from the group 
5 consisting of H, alkyl, halo, aryl, arylalkyl, aminoalkyl, aminoaryl, oxoalkyl, oxoaryl, 
and oxoarylalkyl; and wherein said compound of Formula I binds mixed-sequence 
DNA in the minor groove in a dimer formation. In a preferred embodiment of the 
invention, the compound of Formula 1 is a dication, L is: 

10 
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A is N; B is NH; X is O; Y is CH; R u Ri, R4, R5, R 6 , R?, Rs ; R<> and R 14 are each H; 
5 and R5 and R13 are each H2. 

A second aspect of the present invention is a method of selectively binding 
mixed sequence DNA comprising contacting a sample of DNA with a compound of 
Formula I. 

A third aspect of the present invention is a method of detecting mixed DNA 
10 sequences comprising contacting a sample of DNA with a fluorescent compound of 
Formula I, and then observing fluorescence in the sample, the observation of 
fluorescence indicating that mixed DNA sequences have been bound. 

A fourth aspect of the invention is a pharmaceutical formulation comprising a 
compound of Formula I in a pharmaceutically acceptable carrier. 
1 5 Additional aspects of the invention include methods of controlling gene 

expression, methods of treating microbial infection, methods of treating cancer and other 
disorders of cell proliferation, and methods of treating disorders of genetic origin (i.e., 
where the disease state is caused by a gene mutation or mutations). 

Other aspects of the present invention include the use of an active compound 
20 as described above for the preparation of a medicament for controlling gene 

expression, or medicament for treating a microbial infection, or a method of treating a 
disorder of genetic origin in a subject in need thereof. 

The foregoing and other aspects of the present invention are explained in 
detail in the specification set forth below. 

25 

Brief Description of the Drawings 

FIG 1. sets forth the chemical structures for the minor-groove binding 
compounds netropsin, distamycin, Hoechst 33258, furamidine (DB75), DB270, and 
DB293. FIG. 1 also sets forth the DNA sequences for oligol. oligo2, oligo2-l and 
30 oligo2-2, as described herein. 
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FIG. 2. illustrates the results of a quantitative DNase 1 footprinting titration 
experiment with the compound DB293 on the 265 bp DNA fragment as described 
herein. The EcoKl-Pvull restriction fragment from plasmid pBS was 3'-end labeled at 
the EcoRl site with [a- 32 P]dATP in the presence of AMV reverse transcriptase. As 
5 illustrated in FIG* 2 A, the products of the DNase I digestion were resolved on an 8% 
polyacrylamide gel containing 8M urea. Drug concentrations are (lanes 1-11)0, 0.3, 
0.6, 0.9, 1.2, 1.5, 1.8, 2.1, 2.4, 2.7, 3.0 jiM for DB 293 and (lanes 12-15) 0, 1, 2 and 5 
jiM for DB270. Tracks labeled 'G' represent dimethylsulphate-piperidine markers 
specific for guanines. The track labeled DNA contained no drug and no enzyme. 
1 0 Numbers at the right side of the gel refer to the numbering scheme of the fragment. 
The rectangles on the left side refer to the positions of (open box) an AT-rich and 
(filled box) a GC-rich binding site for DB293. FIG. 2B is a graphical illustration of 
footprinting plots for the binding of DB293 to (open circles) the AT site 5 -AATTAA 
and (filled squares) the GC-rich site S'-ACCATG. The relative band intensity R 

1 5 corresponds to the ratio lc/Io where I c is the intensity of the band at the ligand 

concentration c and I 0 is the intensity of the same band in the absence of DB293. The 
differential cleavage plots shown in FIG. 2C compare the susceptibility of the DNA 
to cutting by DNase I in the presence of (filled circles) 5 DB270 or (open 
squares) 1.5 DB293. Deviation of points towards the lettered sequence (negative 

20 values) corresponds to a ligand-protected site and deviation away (positive values) 
represents enhanced cleavage. The vertical scale is in units of ln(f a ) - ln(f c ), where f a 
is the fractional cleavage at any bond in the presence of the drug and f c is the 
fractional cleavage of the same bond in the control. The results are displayed on a 
logarithmic scale for the sake of convenience. The rectangles below the sequence 

25 show the positions of (open box) the AT binding site and (filled box) the GC-rich site. 
FIG. 3 sets forth Scatchard plots of the results for binding of DB293 and 
DB270 to oligol and oligo2-l along w r ith best fit binding curves are shown: closed 
triangles and open triangles are for DB293 and DB270, respectively, binding to 
oligol . Closed circles and open circles are for DB293 and DB270 binding to oligo2- 

30 1, respectively Because of the weak binding of DB270 to oligo2-l, the results were 
fit with the assumption of a single DB270 binding to the duplex . Sensorgrams with 
the data for this plot are shown in FIG. 6. 
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FIG. 4. is a two-dimensional COSY spectra of the TH6-TCH3 spectral region 
shown for (top, A) free DNA; (middle, B) a 1:1 ratio sample of DB293 to oligo2-l; 
and (bottom, C) a 2:1 ratio. Signals for the free DNA and for the 2: 1 complex in the 
1:1 ratio sample are indicated by connecting lines to the top and bottom spectra. 
5 FIG. 5 illustrates Tm curves as a function of ratio for complexes of DB270 

and DB293 with oligo 2-1. Closed circles indicate free DNA; closed triangles and 
open triangles indicate DB293 at 1:1 and 2:1 ratios, respectively, and closed squares 
and open squares indicate DB270 at 1 :1 and 2:1 ratios, respectively. 

FIG. 6 sets forth sensorgrams for binding of DB270 and DB 293 to (top, A) 
10 oligo2-l and (bottom, B) oligo 1. Drug concentrations range from 1 nM to 1 jiM. 



Detailed Description of the Preferred Embodiments 

The present invention will now be described more fully hereinafter with 
reference to the accompanying drawings, in which preferred embodiments of the 
15 invention are shown. This invention may, however, be embodied in different forms 
and should not be construed as limited to the embodiments set forth herein. Rather, 
these embodiments are provided so that this disclosure will be thorough and complete, 
and will fully convey the scope of the invention to those skilled in the art. 

Unless otherwise defined, all technical and scientific terms used herein have 
20 the same meaning as commonly understood by one of ordinary skill in the art to 

which this invention belongs. All publications, patent applications, patents, and other 
references mentioned herein are incorporated by reference in their entirety. 

Nucleotide sequences are presented herein by single strand only, in the 5 7 to 3' 
direction, from left to right. Nucleotides are represented herein in the manner 
25 recommended by the IUPAC-IUB Biochemical Nomenclature Commission in 
accordance with 37 CFR §1.822 and established usage. See, e.g., Patentln User 
Manual, 99-102 (Nov. 1990) (U.S. Patent and Trademark Office). 

Certain objects, advantages and novel features of the invention will be set 
forth in the description that follows, and will become apparent to those skilled in the 
30 art upon examination of the following, or may be learned with the practice of the 
invention. 

As used herein the term "alky!" refers to Cmo inclusive, linear, branched, or 
cyclic, saturated or unsaturated (i.e., alkenyl and alkynyl) hydrocarbon chains, 
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including for example, methyl, ethyl, propyl, isopropyl, butyl, isobutyl, tert-butyl, 
pentyl, hexyl, octyl, ethenyl, propenyl, butenyl, pentenyl, hexenyl, octenyl, 
butadienyl, propynyl, butynyl, pentynyl, hexynyl, heptynyl, and allenyl groups. As 
used herein, the term "acyl" refers to an organic acid group wherein the -OH of the 
5 carboxyl group has been replaced with another substituent (i.e., as represented by 
RCO — , wherein R is an alkyl or an aryl group). As such, the term "acyl" specifically 
includes arylacyl groups. Specific examples of acyl groups include acetyl and 
benzoyl. As used herein, the term "aryl" refers to 5 and 6-membered hydrocarbon 
and heterocyclic aromatic rings. Specific examples of aryl groups include but are not 

10 limited to cyclopentadienyl, phenyl, furan, thiophene, pyrrole, pyran, pyridine, 

imidazole, isothiazole, isoxazole, pyrazole, pyrazine, pyrimidine, and the like. The 
term "alkoxyl" as used herein refers to Ci_io inclusive, linear, branched, or cyclic, 
saturated or unsaturated oxo-hydrocarbon chains, including for example methoxy, 
ethoxy, propoxy, isopropoxy, butoxy, t-butoxy, and pentoxy. The term "aryloxyl" as 

15 used herein refers to phenyloxyl or hexyloxyl, and alkyl, halo, or alkoxyl substituted 
phenyloxyl or hexyloxyl. As used herein, the terms "substituted alkyl" and 
"substituted aryl" include alkyl and aryl groups, as defined herein, in which one or 
more atoms or functional groups of the aryl or alkyl group are replaced with another 
atom or functional group, including for example, halogen, aryl, alkyl, alkoxy, 

20 hydroxy, nitro, amino, alkylamino, dialkylamino, sulfate, and mercapto. The terms 
"halo," "halide," or "halogen" as used herein refer to fluoro, chloro, bromo, and iodo 
groups. 

As used herein, the term "mixed sequence DNA" refers to a sequence of DNA 
that comprises GC base pairs and AT base pairs. 

25 Compounds of Formula I of the present invention (hereinafter referred to as the 

"active compounds") are useful in binding mixed sequences of DNA, i. e., GC as well as 
AT base pairs. Unexpectedly, the active compounds bind in the minor groove of DNA 
at specific GC containing sequences in a highly cooperative manner as stacked dimers. 
Because of the ability of the compounds of the present invention to bind to specific and 

30 mixed sequences of DNA, they are useful in controlling gene expression by, for 

example, intervening in gene transcription. Accordingly, the active compounds may 

find pharmaceutical use in the treatment of opportunistic infections such Pneumocystis 

carinii, in the treatment of cancers and other disorders of proliferation, and in the 

treatment of genetic disorders caused by, for example, mutations in particular genes 
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{e.g., cystic fibrosis, adult polycystic disease, Huntington's disease, myotonic dystrophy, 
neurofibromatosis, etc.). Moreover, since certain compounds of the present invention 
are fluorescent (i.e., DB293 ; shown in FIG. 1), they are useful in detecting the particular 
DNA sequences bound by the compounds through fluorescence detection methods 
5 known in the art. 

The active compounds of the present invention may be prepared by the methods 
set forth in K. Hopkins et alj. Med. Chem. 41, 3872-3878 (1998). The active 
compounds of the present invention may also be prepared by the methods set forth in R. 
Kadaetal., Collect. Czech. Chem. Comm. 38, 1700-1704 (1973), modified as described 

10 below, the disclosure of which is also incorporated herein in its entirety. Additionally, 
the active compounds may be administered as pharmaceutically acceptable salts. Such 
salts include the gluconate, lactate, acetate, tartarate, citrate, phosphate, borate, nitrate, 
sulfate, and hydrochloride salts. The salts of the present invention may be prepared, in 
general, by reacting two equivalents of the base compound with the desired acid, in 

1 5 solution. After the reaction is complete, the salts are crystallized from solution by the 
addition of an appropriate amount of solvent in which the salt is insoluble. 

As noted above, the methods of the present invention are useful for treating 
opportunistic microbial infections such as, for example, P. carinii and Giardia 
lamblia. The compounds may also be useful in treating fungal infections such as 

20 Candida albicans, Cryptococcus neoformans, Aspergillus fumigatus, Fusarium 

solani, and combinations thereof. The methods of the invention are useful for treating 
these conditions in that they inhibit the onset, growth, or spread of the condition, 
cause regression of the condition, cure the condition, or otherwise improve the general 
well-being of a subject afflicted with, or at risk of contracting the condition. 

25 The compounds of the present invention are useful not only in methods for 

treating infections and other disorders, but also in methods of inhibiting enzymes such as 
topoisomerase. 

Subjects to be treated by the methods of the present invention are typically 
human subjects, although the methods of the present invention may be useful with any 
30 suitable subject known to those skilled in the art. 

As noted above, the present invention provides pharmaceutical formulations 

comprising the aforementioned active compounds, or pharmaceutically acceptable 

salts thereof, in pharmaceutically acceptable carriers for oral, intravenous, or aerosol 

administration as discussed in greater detail below. Also, the present invention 
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provides such compounds or salts thereof which have been lyophilized and which 
may be reconstituted to form pharmaceutically acceptable formulations for 
administration, as by intravenous or intramuscular injection. 

The therapeutically effective dosage of any specific compound, the use of 
5 which is in the scope of present invention, will vary somewhat from compound to 
compound, and patient to patient, and will depend upon the condition of the patient 
and the route of delivery. As a general proposition, a dosage from about 0. 1 to about 
50 mg/kg will have therapeutic efficacy, with all weights being calculated based upon 
the weight of the active compound, including the cases where a salt is employed. 
10 Toxicity concerns at the higher level may restrict intravenous dosages to a lower level 
such as up to about 1 0 mg/kg, with all weights being calculated based upon the weight 
of the active base, including the cases where a salt is employed. A dosage from about 
10 mg/kg to about 50 mg/kg may be employed for oral administration. Typically, a 
dosage from about 0.5 mg/kg to 5 mg/kg may be employed for intramuscular 
15 injection. Preferred dosages are 1 pmol/kg to 50 |Jmol/kg, and more preferably 22 

pmol/kg and 33 \lmoVkg of the compound for intravenous or oral administration. The 
duration of the treatment is usually once per day for a period of two to three weeks or 
until the condition is essentially controlled. Lower doses given less frequently can be 
used prophylactically to prevent or reduce the incidence of recurrence of the infection. 
20 In accordance with the present method, pharmaceutically active compounds as 

described herein, or pharmaceutically acceptable salts thereof, may be administered 
orally as a solid or as a liquid, or may be administered intramuscularly or 
intravenously as a solution, suspension, or emulsion. Alternatively, the compounds 
or salts may also be administered by inhalation, intravenously or intramuscularly as a 
25 liposomal suspension. When administered through inhalation the active compound or 
salt should be in the form of a plurality of solid particles or droplets having a particle 
size from about 0.5 to about 5 microns, and preferably from about 1 to about 2 
microns. 

The present invention also provides a pharmaceutical composition suitable for 

30 intravenous or intramuscular injection. The pharmaceutical composition comprises a 

compound of Formula (I) described herein, or a pharmaceutically acceptable salt 

thereof in any pharmaceutically acceptable carrier. If a solution is desired, water is 

the carrier of choice with respect to water-soluble compounds or salts. With respect 

to the water-insoluble compounds or salts, an organic vehicle, such as glycerol, 
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propylene glycol, polyethylene glycol, or mixtures thereof, may be suitable. In the 
latter instance, the organic vehicle may contain a substantial amount of water. The 
solution in either instance may then be sterilized in a suitable manner known to those 
in the art, and typically by filtration through a 0.22 micron filter. Subsequent to 
5 sterilization, the solution may be dispensed into appropriate receptacles, such as 
depyrogenated glass vials. Of course, the dispensing is preferably be done by an 
aseptic method. Sterilized closures may then be placed on the vials and, if desired, 
the vial contents may be lyophilized. 

In addition to compounds of Formula (I) or their salts, the pharmaceutical 
10 compositions may contain other additives, such as pH-adjusting additives. In 

particular, useful pH-adjusting agents include acids, such as hydrochloric acid, bases 
or buffers, such as sodium lactate, sodium acetate, sodium phosphate, sodium citrate, 
sodium borate, or sodium gluconate. Further, the compositions may contain microbial 
preservatives. Useful microbial preservatives include methylparaben, propylparaben, 
15 and benzyl alcohol. The microbial preservative is typically employed when the 

formulation is placed in a vial designed for multidose use. Of course, as indicated, the 
pharmaceutical compositions of the present invention may be lyophilized using 
techniques well known in the art. 

In yet another aspect of the present invention, there is provided an injectable, 
20 stable, sterile composition comprising a compound of Formula (I), or a salt thereof, in 
a unit dosage form in a sealed container. The compound or salt is provided in the 
form of a lyophilizate which is capable of being reconstituted with a suitable 
pharmaceutically acceptable carrier to form a liquid composition suitable for injection 
thereof into a subject. The unit dosage form typically comprises from about 10 mg to 
25 about 10 grams of the compound or salt. When the compound or salt is substantially 
water-insoluble, a sufficient amount of emulsifying agent which is physiologically 
acceptable may be employed in sufficient quantity to emulsify the compound or salt 
in an aqueous carrier. One such useful emulsifying agent is phosphatidyl choline. 

Other pharmaceutical compositions may be prepared from the water-insoluble 
30 compounds disclosed herein, or salts thereof such as aqueous base emulsions. In 
such an instance, the composition will contain a sufficient amount of 
pharmaceutically acceptable emulsifying agent to emulsify the desired amount of the 
compound or salt thereof. Particularly useful emulsifying agents include phosphatidyl 
cholines, and lecithin. 
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Further, the present invention provides liposomal formulations of the 
compounds disclosed herein and salts thereof. The technology for forming liposomal 
suspensions is well known in the art. When the compound or salt thereof is an 
aqueous-soluble salt, using conventional liposome technology, the same may be 
5 incorporated into lipid vesicles. In such an instance, due to the water solubility of the 
compound or salt, the compound or salt will be substantially entrained within the 
hydrophilic center or core of the liposomes. The lipid layer employed may be of any 
conventional composition and may either contain cholesterol or may be cholesterol- 
free. When the compound or salt of interest is water-insoluble, again employing 

1 0 conventional liposome formation technology, the salt may be substantially entrained 
within the hydrophobic lipid bilayer which forms the structure of the liposome. In 
either instance, the liposomes which are produced may be reduced in size, as through 
the use of standard sonication and homogenization techniques. 

Of course, the liposomal formulations containing the compounds disclosed 

1 5 herein or salts thereof, may be lyophilized to produce a lyophilizate which may be 

reconstituted with a pharmaceutically acceptable carrier, such as water, to regenerate a 
liposomal suspension. 

Pharmaceutical formulations are also provided which are suitable for 
administration as an aerosol, by inhalation. These formulations comprise a solution or 

20 suspension of a desired compound described herein or a salt thereof, or a plurality of 
solid particles of the compound or salt. The desired formulation may be placed in a 
small chamber and nebulized. Nebulization may be accomplished by compressed air 
or by ultrasonic energy to form a plurality of liquid droplets or solid particles 
comprising the compounds or salts. The liquid droplets or solid particles should have 

25 a particle size in the range of about 0.5 to about 1 0 microns, more preferably from 
about 0.5 to about 5 microns. The solid particles can be obtained by processing the 
solid compound or a salt thereof, in any appropriate manner known in the art. such as 
by micronization. Most preferably, the size of the solid particles or droplets will be 
from about 1 to about 2 microns. In this respect, commercial nebulizers are available 

30 to achieve this purpose. The compounds may be administered via an aerosol 

suspension of respirable particles in a manner set forth in U.S. Patent No. 5,628,984, 

the disclosure of which is incorporated herein by reference in its entirety. 

Preferably, when the pharmaceutical formulation suitable for administration as 

an aerosol is in the form of a liquid, the formulation will comprise a water-soluble 
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compound or a salt thereof, in a carrier which comprises water. A surfactant may be 
present which lowers the surface tension of the formulation sufficiently to result in the 
formation of droplets within the desired size range when subjected to nebulization. 
As indicated, the present invention provides both water-soluble and water- 
5 insoluble compounds and salts thereof As used in the present specification the term 
"water-soluble" is meant to define any composition which is soluble in water in an 
amount of about 50 mg/mL, or greater. Also, as used in the present specification, the 
term "water-insoluble" is meant to define any composition which has solubility in 
water of less than about 20 mg/mL. For certain applications, water soluble 
10 compounds or salts may be desirable whereas for other applications water-insoluble 
compounds or salts likewise may be desirable. 

The following Examples are provided to illustrate the present invention, and 
should not be construed as limiting thereof. 



15 EXAMPLE 1 

Synthesis of Compounds of Formula I 

2-[5(6)-Nitro-2-benzimidazoyl]-5-(4-nitrophenyl)furan was prepared 
according to a modified literature procedure (R. Kada et al., Collect. Czech. Chem. 

20 Comm. 38, 1 700- 1 704 (1 973)) by reaction of 5-(4-nitrophemyl)furfural (10 mmol) 
with 4-nitro-l,2-phenylenediamine (10 mmol) in a mixture of DMF (25 ml) and 
nitrobenzene (5 ml) at 150 °C for 22 h (under nitrogen). Cooling to room-temperature 
gave a suspended solid which was diluted with MeOH (30 ml), collected, and finally 
rinsed well with ether. Yield: 2.56 g, 73%; mp 350-351 °C dec; lit mp 348-350 °C). 

25 ] H NMR (DMSO-40: 7.51 (d, 7= 3.7 Hz, 1H), 7.57 (d, J= 3.7 Hz, 1H), 7.78 (d, J= 
8.9 Hz, 1H), 7.94 (s, 1H), 8.14 (dd, J= 8.9, 2.2 Hz, 1H), 8.17 (d, /= 8.8 Hz 5 2H), 
8.36 (d, J = 9.1 Hz, 2H), 8.47 (d, J= 1 .7 Hz, 1H) (benzimidazole NH not observed). 

2-[5(6)-Amino-2-benzimidazoyl]-5-(4-aminophenyl)furan. To a suspension 
of 2-[5(6)-nitro-2-benzimidazoyl]-5-(4-nitrophenyl)furan (2.63 g, 7.5 mmol) in EtOH 

30 (100 ml) was added stannous chloride dihydrate (16.0 g, 71 mmol) and the mixture 

was refluxed under nitrogen with vigorous stirring for 3 hr to give a solution. After 

stirring at room-temperature overnight, the solution was made basic by addition of 

aqueous NaOH and the solids were extracted with EtOAc. After drying (Na2SC>4) and 

filtering, the solvent was removed in vacuo and the residue was dissolved in EtOH. 

35 This solution was then diluted with water to give a greenish yellow solid which was 
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collected and dried in the desiccator (P 2 0 5 ). Yield: 0.95 g, 44%; mp 161-165 °C dec. 
In contrast to the bis-nitro derivative, the ] H NMR of this bis-amine was quite 
complex indicating it exists as a mixture of the two possible tautomers. The 'H NMR 
of the hydrochloride salt, prepared by dissolving a sample of the free base in 
5 HCl/EtOH followed by concentration, was less complex (DMSOrf 6 , D 2 0): 7.17 (d, J 
= 8.6 Hz, 2H), 7.21 (d, 7= 3.7 Hz, 1H), 7.30 (dd, J = 8.7, 1.9 Hz, 1H), 7.63 (d, J = 1.9 
Hz, 1H), 7.74 (d, J= 8.6 Hz, 1H), 7.78 (d, J= 3.8 Hz, 1H), 7.94 (d, J = 8.6 Hz, 2H). 

2-[5(6)-Guanidino-2-benzimidazoyl]-5-(4-guanidinophenyl)furan. To a 
chilled solution of 2-[5(6)-amino-2-benzimidazoyl]-5-(4-aminophenyl)fiiran (0.363 
10 g. 1.25 mmol) and l,3-bis(/er/-butoxycarbonyl)-2-methyl-2-thiopseudourea (0.755 g, 
2.60 mmol) in dry DMF (25 ml) was added triethylamine (0.78 g, 7.71 mmol) 
followed by mercury(II) chloride (0.78 g, 2.87 mmol) and the resulting suspension 
was stirred at ambient temperature for 3 days. After diluting with CH 2 C1 2 and 
filtering over Celite, the dark solution was washed well with saturated Na2C03 
1 5 solution, with water (3 times), and finally with brine. After drying (Na2SC>4), the 
solvent was removed in vacuo and the remaining oil was diluted with MeOH to give 
the BOC-protected bis-guanidine as a pale green solid in two crops (0.58 g). The 
product was purified by reprecipitation from C^Ch/MeOH to give, after partial 
concentration, a fluffy pale green solid (0.42 g, 43%), mp >400 °C dec, with 
20 darkening >300°C. 

For deprotection, a solution of the protected bis-guanidine in CHC1 3 (12 ml) 
and EtOH (10 ml) was saturated with dry HC1 at 0-5 °C and allowed to stir for 2 days 
at room-temperature to give a orange-colored suspension. After removing the 
solvents in vacuo, the solid was taken up in hot EtOH (60 ml), a small amount of 
25 insoluble material was filtered off, and the solvent was again removed. After 

trituration with ether, the yellow solid was collected and dried in vacuo for 3 days at 
50-60 °C. Yield: 0.24 g, 92% (40% overall from the bis-amine). , HNMR(DMSO- 
d 6 ): 7.24 (d, J= 8.6 Hz, 1H), 7.33 (d, J = 3.6 Hz, 1H), 7.38 (d,7= 8.6 Hz, 2H), 7.51 
(br s, 3H), 7.57 (s, 1H), 7.64 (br s, 3H), 7.68 (apparent s, 1H), 7.74 (d, J= 8.6 Hz, 
30 lH),8.05(d,J=8.5Hz,2H), 10.05 (brs, 1H), 10.19 (brs, 1H). FABMS 

(thioglycerol): m/z 375 (100). FABHRMS :Calcd. for C 19 H, 8 N 8 0 (MH + ) : 375.1682. 
Found: 375.1670. Anal Calcd for C,9H, 8 N 8 0.3HC1*2H 2 0: C, 43.90; H, 4.85; N, 
21.56; CI, 20.46. Found: C, 43.68; H, 4.47; N, 20.68; CI, 20.46. 
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l-[ (5-Bromobenzo[b]furan-2-ylJ-3-dimethylaminopropane 
hydrochloride. A mixture of 2-acetyl-5-bromobenzo[b]furan (23.9 g, 0.1 mol), 
dimethylamine hydrochloride( 8.15 g, 0.1 mol), paraformaldehyde (3.6 g) and 2 ml of 
35% hydrochloric acid in 150 ml of ethanol was heated at reflux for 20 h (TLC 
5 followed). The solvent volume was reduced under reduced pressure to 50 ml and a 
mixture of acetone: ether (1 :2) was added and the resultant solid was filtered, washed 
with ether and dried at 45°C in a vacuum oven for 24 h to yield 23.0 g (69%), mp 
185-187°C dec. 'H NMR (DMSO-</ 6 ): 8.07 (d, 7= 2.0 Hz, 1H), 7.91 (s, 1H), 7.70 (d, 
7= 8.8 Hz, 1H), 7.66 (dd, J = 2.0 Hz, J = 8.8 Hz, 1H), 3.58 (t, J - 7.2, 2H), 3.41 (t, J = 
10 7.2,2H),2.78(s,6H). 13 C NMR (DMSO-cfo)- 186.7, 153.7, 152.2, 131.2, 128.8, 126.0, 
1 16.2, 114.3, 1 13.5, 51.0, 42.2, 33.3. The presence of small amount (ca. 5%) of the 
corresponding elimination product (vinyl ketone) was apparent from the ] H NMR; the 
product was used directly in the next step with out further purification. 

1- [(5-Bromobenzo[b]furan-2-yl]-4-(4-bromophenyl)butane-l,4-dione. A 

1 5 mixture of the above Mannich base (16.6 g, 0.05 mol), 3-benzyl-5(2-hydroxyethyl)-4- 
methyl thiazolium chloride catalyst( 0.68, 0.0025 mol), triethylamine (15.15 g, 0.15 
mol) and 4-bromobenzaldehyde (9.25 g, 0.05 mol) in 180 ml dioxane was heated at 
reflux for 12 h (under nitrogen). The solvent was removed under reduced pressure and 
the residue was treated with water. The resultant gummy material was extracted with 

20 150 ml of chloroform. The organic layer was dried over MgS0 4 and the solvent was 
removed under reduced pressure. The residue was treated with EtOH:ether( 1:1) the 
solid which remained was filtered, washed with ether and dried to yield 7.4 g(34%); 
mp 176-177°C. ). *H NMR (DMSO-J 6 ): 8.03 (dd, 7= 0.4 and 1.5 Hz, 1H), 7.91 (d 5 J 
= 8.4 Hz, 2H), 7.82(d, J =0.4, 1H) 7.72 (d, J = 8.4 Hz, 2H), 7.68 (d, J = 8.8, 1H) 7.64 

25 (dd,J = 1.5 and 8.8 Hz, 1H), 3.40-3.45 (m, 2H), 3.37-3.33 (m, 2H). 13 CNMR 
(DMSO-rf 6 ): 197.3,188.9, 153.5, 152.7, 135.2, 131.5, 130.7, 129.6, 128.8, 127.0, 
125.7,115.9,114.1,112.4,32.3, 32.0. MS m/c 436 (M + ). Anal. Calcdfor 
Ci8H 12 Br 2 0 3 C, 49.57; H, 2.77. Found: C, 49.49; H, 2.74. 

2- [(5-Bromobenzo[b]furan-2-ylJ-5-(4-bromophenyl)furan. A solution of 
30 the above diketone (8.72 g, 0.02 mol) in 150 ml CHCh:MeOH( 1 :1) was saturated 

with HC1 gas, stirred at room temperature for 4h ( TLC followed). The solvent was 
removed under reduced pressure and the residue was stirred with 200 ml 10% 
aqueous NaHC0 3> filtered, washed with water, dried and recrystallized from ether: 
CH 2 C1 2 (4: 1) to yield white solid 7.1 g( 85%) mp 204-206T. ] H NMR (DMSO-</ 6 ): 
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7.86 (d, J =2.0. 1H)), 7.76 (d, J= 8.4 Hz, 2H), 7.65(d, J = 8.4, 2H) 7.58 (d, J = 8.4 Hz, 
1H), 7.45 (dd, J= 2.0 and 8.4 Hz, 1H) 7.23 (s, 1H), 7.17 (d, J = 4.0 Hz, 1H), 7.1 l(d, J 
= 4.0 Hz, 1H). ,3 CNMR(DMSO-tf 6 ): 152.9, 152.7, 148.0, 144.1,131.6, 130.4, 128.4, 

127.0. 125.5, 123.3, 120.9 115.5, 112.7,111.2, 108.6, 101.1. MS m/e 436(M + ). Anal. 
5 Calcd for Ci 8 H, 2 Br 2 0 3 C, 49.57; H, 2.77. Found: C, 49.49; H, 2.74. 

2-[(5-Cyanobenzo[b]furan-2-yl]-5-(4-cyanophenyl)furan. A mixture of the 
above dibromo compound (8.36 g, 0.02 mol) and CuCN( 5.34 g, 0.06 mol) in 60 ml of 
N-methyl-2-pyrolidinone was heated at reflux for 4h ( under nitrogen), cooled, diluted 
with water and stirred with 200 ml of 10% aqueous NaCN for 3 h. The solid was 

10 filtered, washed with water and dried. The crude product was dissolved in 

CHCI 3 :MeOH( 1 : 1 ) and chromatographed over neutral alumina to yield a pale yellow 
solid 4.35g(70%), mp 247-248°C. 'H NMR (DMSO-</ 6 ): 8.18 (d, J =1.6, 1H)), 7.98 
(d, J = 8.0 Hz, 2H), 7.88(d, J = 8.0, 2H) 7.81 (d, J = 8.4 Hz, 1H), 7.73 (dd, J= 1.6 
and 8.4 Hz, 1H) 7.41 (s, 1H), 7.38 (d,lH, J = 3.6 Hz), 7.21(d,lH, J =3.6 Hz). 13 C 

15 NMR(DMSO-</ 6 ): 155.6, 152.4, 148.4, 144.7,132.9, 132.6, 128.8, 128.3, 126.1, 

124.1, 118.6, 118.3,112.3, 111.9, 111.2, 106.4, 101.9. MS m/e 310(M*). Anal Calcd 
for C 2 oH 10 N202 C, 77.41; H, 3.25; N, 9.02. Found: C, 77.41; H, 3.26; N, 8.95. 

2-((5-Amidinobenzo[b]furan-2-yl]-5-(4-amidinophenyl)furan 
dihydrochloride. The above dicyano compound (3.1 g, 0.01 mol) in 70 ml of ethanol 

20 was saturated with dry HC1 gas at 0-5°C and then stirred at room temperature for 8 da 
(monitored by IR and TLC). Ether was added to the mixture and the yellow imidate 
ester dihydrochloride was filtered and washed with ether. The solid was dried at 50°C 
in a vacuum for 24 h, to yield 4.3 g (93%). The solid was used directly in the next step 
without further purification. 

25 A suspension of imidate ester dihydrochloride ( 1 .43 g, 0.003 mol) in 20 ml of 

ethanol was saturated with ammonia gas, stirred for 24 h and the solvent was removed 
under reduced pressure. The solid was suspended in water and the pH was adjusted to 
9 and the ofT-white solid was filtered. The solid was stirred in HC1 saturated ethanol 
and the yellow salt was filtered and dried in a vacuum oven at 75°C for 24 h to yield 

30 0.7 g ( 68%) mp 320 dec. ! H NMR (DMSO-<VD 2 0): 8.20 (d, J =1.2, 1H)), 8.01 (d, 

J = 8.0 Hz, 2H), 7.74(d, J = 8.0, 2H) 7.82 (d, J = 8.4 Hz, 1H), 7.78 (dd, J= 1 .2 and 

8.4 Hz, 1H) 7.47 (s, 1H), 7.37 (d,lH, J = 3.6 Hz), 7.20(d,lH, J =3.6 Hz). 13 C NMR 

(DMSO-rf 6 ): 165.7,164.8, 156.7, 152.8, 148.6, 145.0,134.0, 128.9, 128.7, 126.4, 

124.9, 123.9, 123.3, 122.0,112.1, 1 1 1.8,1 1 1 .2.102.5. FABMS m/e 345(M + +\)Anal 
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Calcd for C 2 oH, 6 N 4 02 •2HCl*0.5H 2 O: C, 56.36; H, 4.49; N, 13.14. Found: C, 56.73; 
H, 4.71; N, 12.71. 

EXAMPLE 2 
DNA Fingerprinting Studies 

5 

In order to characterize the DNA recognition properties of a series of analogs 
of furamidine (shown in FIG. 1), quantitative DNAasel footprinting studies were 
conducted using of a number of derivatives with several different DNA sequences. 
Plasmid DNA restriction fragments were prepared and DNasel footprinting 

10 experiments were conducted as described in C. Bailly, et al., Biochemistry 35, 1 150 
(1996) and C. Bailly et al, Anti Cancer Drug Design (in press, 1999). 

FIG. 2. illustrates the results of a quantitative DNase I footprinting titration 
experiment with the compound DB293 on a 265 bp DNA fragment as described 
herein. The EcoRl-PwU restriction fragment from plasmid pBS was 3'-end labeled at 

15 the EcdRI site with [<x- 32 P]dATP in the presence of AMV reverse transcriptase. As 
illustrated in FIG. 2A, the products of the DNase I digestion were resolved on an 8% 
polyacrylamide gel containing 8M urea. Drug concentrations are (lanes 1-1 1) 0, 0.3, 
0.6, 0.9, 1.2, 1.5, 1.8, 2.1, 2.4, 2.7, 3.0 nM for DB 293 and (lanes 12-15) 0, 1, 2 and 5 
\xM for DB270. Tracks labeled f G ? represent dimethylsulphate-piperidine markers 

20 specific for guanines. The track labeled DNA contained no drug and no enzyme. 
Numbers at the right side of the gel refer to the numbering scheme of the fragment. 
The rectangles on the left side refer to the positions of (open box) an AT-rich and 
(filled box) a GC-rich binding site for DB293. FIG. 2B is a graphical illustration of 
footprinting plots for the binding of DB293 to (open circles) the AT site 5*-AATTAA 

25 and (filled squares) the GC-rich site 5'-ACCATG. The relative band intensity R 
corresponds to the ratio la% where I c is the intensity of the band at the ligand 
concentration c and I 0 is the intensity of the same band in the absence of DB293. The 
differentia] cleavage plots shown in FIG. 2C compare the susceptibility of the DNA 
to cutting by DNase I in the presence of (filled circles) 5 \iM DB270 or (open 

30 squares) 1.5 uM DB293. Deviation of points towards the lettered sequence (negative 
values) corresponds to a ligand-protected site and deviation away (positive values) 
represents enhanced cleavage. The vertical scale is in units of ln(f a ) - ln(f c ), where f a 
is the fractional cleavage at any bond in the presence of the drug and f c is the 
fractional cleavage of the same bond in the control. The results are displayed on a 
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logarithmic scale. The rectangles below the sequence show the positions of (open 
box) the AT binding site and (filled box) the GC-rich site. 

Results with the symmetric compounds furamidine and the bisbenzimidazole 
DB270 are as expected for AT specific minor- groove binding agents and agree with 
5 observations on other furan derivatives and related compounds. With the 

asymmetrical compound DB293 (FIG. 1), however, the footprinting results present a 
number of surprises in the form of strong footprints in unexpected GC-rich regions, as 
shown in FIG. 2. In the 90-100 base region of the 265mer pBS fragment in Fig. 2, for 
example, DB293 gave a very strong footprint while DB270 and furamidine give 

10 negligible footprints. The most surprising feature of the footprint in this sequence 
region is its GC content relative to the AT rich sequences, where footprints are 
usually observed with minor-groove agents 

Quantitative analysis of the footprinting data reveals that the Cso value, the 
drug concentration required for half-maximal footprinting, at the ATGA site is 

15 significantly lower than at the neighboring ATTA site, indicating that DB293 prefers 
the site including a GC base pair over the site containing only AT base pairs. The 
differential cleavage plots show that both DB270 and DB293 bind similarly to sites 
composed exclusively of AT base pairs (FIG. 2). Footprinting studies with several 
restriction fragments showed DB293, but not DB270, strongly binds to sites 

20 containing GC base pairs, such as ATGA, ACGA, and ATGT. 

EXAMPLE 3 
Thermal Melting Experiments 

25 In order to investigate the complexes of these compounds in more detail with 

GC rich sequences, a hairpin duplex model containing the 93-104 base sequence 
region from the 265mer pBS restriction fragment was synthesized and is illustrated as 
oligo2 in FIG. 1. Oligol (also shown in FIG. 1) with the AATT sequence that has 
been used in the analysis of a large number of minor-groove agents provides a 

30 reference. 

Thermal melting experiments were conducted with a Cary 4 
spectrophotometer interfaced to a microcomputer. A thermistor fixed into a reference 
cuvette was used to monitor the temperature. The oligomers were added to 1 mL of 
buffer (0.01 M MES and 0.001M EDTA) in 1 cm path length reduced volume quartz 

35 cells, and the concentration was determined by measuring the absorbance at 260 nm. 
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Experiments were generally conducted at a concentration of 2 x 10~ A M for hairpin 
oligo2, and 3 x 10~ 6 M for hairpin oHgo2-l . Tm experiments for the complexes were 
conducted as a function of ratio. 

Tm determinations of oligo2 on titration with DB293 gave up to a 30°C 
5 increase in Tm and did not level off until a ratio of 4:1 DB293:hairpin duplex had 
been reached. The high ratio of DB293 to oligomer duplex was surprising for a 
duplex of only 13 base pairs. In order to better understand the nature of the complex, 
divided oligo2 was divided into two similar hairpin duplexes, oligo2-l and oligo2-2 
(FIG. 1). As an illustration of the results obtained, derivative Tm curves of DB270 

10 and DB293 complexes with oligo2-l are shown in FIG. 5. The DB293 complex has a 
biphasic melting curve at a 1 :1 ratio with a high temperature phase and a low 
temperature phase near the Tm of the free hairpin duplex. At a 2: 1 ratio, the low 
temperature phase disappears and only the high temperature transition is present. 
Melting curves of DB270 and furamidine complexes with oligo2-l have single 

1 5 transitions at 1 : 1 and 2:1 ratios with melting temperatures below the DB293 value. 
As with the footprinting experiments, these results illustrate the dramatic differences 
in DNA interactions between the symmetric compounds relative to the unsymmetric 
DB293. In addition, the Tm ratio results suggest that the unusual DNA recognition 
properties of DB293 are due to formation of 2:1 complexes with oligo2-l and 2*2, 

20 and a 4:1 complex with oligo2. Such dimer complexes could also explain the 

unexpected footprinting behavior of DB293, however, based on the +2 charge of 
DB293, dimer complexes are not expected. 



EXAMPLE 4 

25 Surface Plasmon Resonance Experiments 

To pursue the comparative quantitative analysis of these compounds with 
DNA in more detail by using surface plasmon resonance, 5'-biotin labeled analogs of 
oligo2-l and 2-2 were immobilized on a BIAcore four-channel streptavidine-coated 
30 sensor chip as follows: Immobilization of DNA and surface plasmon resonance 
(SPR) binding studies: 5'-biotin labeled hairpins were purchased with HPLC 
purification (Midland Co). Samples of the DNA in MES10 buffer (0.01M MES and 
0.001M EDTA, with 0.1M NaCl) at 50nM concentration were applied to a BIAcore 
SA (streptavidin) chip by direct flow at Sjal/min in a BIAcore 2000 SPR instrument. 
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Nearly the same amount of oligol, oligo2-l and oligo2-2 were immobilized on the 
surface of the S A chip. Steady state analysis was performed with multiple injections 
of different concentrations of each compound over the SA surface at a flow rate of 
20^1/min, at 25°C. 

5 Oligol was immobilized as a control sequence and one flow cell was left as an 

unmodified reference. Binding results from the SPR experiments were fit with either 
a single site model (K 2 = 0) or with a two site model: r = (K, *C frcc + 2* Ki*K 2 *Cf ree 2 ) 
/ (1 + Ki*Cfree + 2*K]*K 2 *Cf ree 2 ) where r represents the moles of bound compound 
per mole of DNA hairpin duplex, Ki and K 2 are macroscopic binding constants, and 

10 Cfrce is the free compound concentration in equilibrium with the complex. The free 
compound is fixed by the concentration in the flow solution. Binding of all of the 
furan derivatives to oligol is best fit by the single site model, while binding of DB293 
to oligo2-l and 2-2 requires the two site model and K 2 is found to be much greater 
than K] as expected for interactions with very large positive cooperativity. Oligos 1 

15 and 2-1 are shown in FIG. 3 to illustrate the differences. 

The binding of all of the furan compounds to oligol is similar and saturation is 
reached at a 1 :1 ratio, as expected from results with a number of minor-groove 
binding cations with DNA duplexes containing an AATT sequence. The results for 
DB293 binding to oligo2-l and 2-2 are, however, dramatically different from results 

20 with the symmetric compounds, and are dramatically different from the results 

obtained with oligol and DB293. Scatchard plots for binding of DB293 and DB270 
are set forth in FIG. 3. 

As in footprinting experiments with AT sites (FIG. 2), DB270 and DB293 
bind in a very similar manner to oligol with linear Scatchard plots indicating one type 

25 of strong binding site that binds a single molecule of DB270 or DB293 with binding 
constants of 2.3-2. 6xl0 7 . Binding of DB270 to oligo2-l is at least a factor often 
weaker than its binding to oligol and probably represents its interaction at the TAT 
sequence in the oligomer that is too short to form a very strong minor-groove 
complex. As shown in FIG. 3, however, binding of DB293 with oligo2-l is highly 

30 cooperative and saturates at two molecules of DB293 per oligo2-l hairpin duplex. 

Fitting of the binding results to a two site model to determine the macroscopic binding 
constants gave a binding constant (Ki) of 2.8x1 0 6 for initial binding and a K 2 of 
7.3x1 0 7 for binding to the second site after the first site is filled. Very similar results 
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are obtained for binding of DB293 to oligo2-2. The similarity of binding constants 
for DB270 and the first molecule of DB293 binding to oligo2-l and 2-2 suggests that 
these are similar processes. The dramatic difference occurs when the second 
molecule of DB293 binds cooperatively with a K 2 that is over 25 times larger than for 
5 binding of DB270 and the first molecule of DB293 (Kj) to the oligomers. These 
results strongly suggest that the unusual footprinting pattern observed with DB293 is 
due to formation of a highly cooperative 2:1 complex in specific DNA sequences. 
The close analogs, furamidine and DB270, do not bind strongly or footprint in these 
DNA sequences. Since all three furan compounds are dications, it is clearly structure, 

1 0 and not charge, that prevents the symmetric derivatives from forming the dimer 
complex. 

EXAMPLE 5 
Structural Studies of Furan Derivatives 

1 5 Structural studies of a number of furan derivatives with oligomers containing 

the AATT sequence of oligol, including X-ray structures of furamidine and alkyl 
derivatives, have clearly demonstrated a 1:1 classical minor-groove binding complex 
in which the amidine groups interact with the edges of A and T bases at the floor of 
the groove in the AATT site. See C. A. Laughton, Biochemistry 35, 5655 (1996) and 

20 S. Neidle, Biopolymers 44, 105 (1997). This is the type of complex expected from the 
experimental results of the furans of FIG. 1 with oligol. In order to characterize the 
2:1 complex of DB293, NMR studies of the DB293-oligo2-l complex were initiated. 
All NMR spectra were acquired with a Varian Unity Plus 600 MHz spectrometer. 
Typical conditions for the collection of spectra in D 2 0: 2 s relaxation delay, 0.6 mL 

25 sample in a 5 mm NMR tube, and 1.0 Hz line broadening before Fourier 

transformation. Two-dimensional experiments were obtained with a spectral width of 
6000 Hz in both dimensions with 2048 complex data points in the t2 dimension and 
512 points in the tl dimension, while ID spectra were collected with a spectral width 
of 6000 Hz and 32K data points. 

30 In proton NMR titrations of the oligomer duplex with DB293 only two DNA 

species are detected at a 1:1 molar ratio. The two species are clearly illustrated with 

2D COSY spectra in FIG. 4 for the aromatic to thymine methyl NMR spectral region. 

The free DNA has six well resolved TH6-TCH3 cross peaks as expected for the six T 

residues in the oligomer. In the 1 : 1 ratio complex there are 12 cross peaks as 

35 expected for two species in slow exchange, and one species has the same chemical 
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shifts as the free DNA. At the 2:1 ratio the free DNA signals disappear and the 

signals for the 2:1 complex double in intensity. The 2:1 complex and the free DNA 

are the only species observed in the 1 : 1 ratio COSY spectrum (FIG. 4) in agreement 

with the high cooperativity observed in the binding experiments. No intermediate 

5 signals for a 1 :1 complex can be detected in any experiments throughout the titration 

of oligo2-l with DB293, and the two species that we observe at the 1:1 ratio are free 

oligomer and the 2:1 complex. In 2D NOESY analysis strong signals are obtained for 

the C H5-H6 interactions, and again only signals for free DNA and the 2:1 complex 

are detected (not shown). In contrast, two sets of cross peaks are detected for DB293 

10 in the oligo2-l complex as expected for two distinct bound molecules in slow 

exchange. Cross peaks between the two DB293 molecules and from DB293 to DNA 

minor-groove protons clearly show that the compound binds in the minor groove as 

an antiparallel dimer and makes contact with both DNA strands. Strong crosspeaks 

from the two bound DB293 molecules to DNA base pairs from T4»A15 to C7-G12 

1 5 are observed and these interactions place the dimer in the ATGA sequence that is 

common to both oligo2-l and 2-2. 

From these results it is clear that all three furan derivatives of FIG. 1 bind to 

the AATT sequence in oligol as classical minor-groove monomer complexes. The 

symmetric compounds such as DB270 and furamidine do not form the dimer species 

20 in a DNA complex and, therefore, do not bind to DNA sequences that do not have 

classical AT minor-groove binding sites. DB293 forms an antiparallel, stacked dimer 

in complex with DNA sites that contain ATGA and probably other sequences. The 

dimer complex provides a new motif for understanding and design of compounds that 

can recognize DNA sequences containing both AT and GC base pairs. The results 

25 presented herein show that the binding of aromatic dications to mixed DNA 

sequences is exquisitely sensitive to compound structure and DNA sequence. 

Although Applicants do not wish to be bound by any theory of the invention, it 

appears that the reasons for the cooperative formation of the DB293 dimer complex 

are encoded in the interactions between a specific DNA sequence and the orientation 

30 of chemical groups in the dimer. Favorable stacking of DB293 to give a dimer in the 

context of the anionic DNA minor groove can also contribute to the 2:1 complex. 

The foregoing is illustrative of the present invention and is not to be construed 

as limiting thereof. The invention is defined by the following claims, with 

equivalents of the claims to be included therein. 
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THAT WHICH IS CLAIMED IS: 

1 . A compound of Formula I: 




R4 

wherein: 

X is selected from the group consisting of O, S, and NH; 

YisCHorN; 

A is CH or N; 

B is selected from the group consisting of NH, O or S; 

Ri is selected from the group consisting of H, loweralkyl, halogen, oxyalkyl, 
oxyaryl, and oxyarylakyl; 

R 2 and R 9 are each independently selected from the group consisting of H, H 2 , 
hydroxy, lower alkyl, cycloalkyl, aryl, alkylaryl alkoxyalkyl, hydroxycycloalkyl, 
alkoxycycloalkoxy, hydroxyalkyl, aminoalkyl and alkylaminoalkyl; and 

R 3 , R4, R13 and R14 are each independently selected from the group consisting of 
H, lower alkyl, alkoxyalkyl, cycloalkyl aryl, alkylaryl, hydroxyalkyl, aminoalkyl, and 
alkylaminoalkyl, or R 3 and R4 together or R13 and Ru together represent a C 2 to C ]0 
alkyl, hydroxyalkyl or alkylene, or R 3 and R4 together or Ro and R M together are: 




wherein n is a number from 1 to 3, and R ]0 is H or -CONHRnNRi 5 Ri 6 , wherein 
Rn is lower alkyl and R !5 and R 16 are each independently selected from the 
group consisting of H and lower alkyl; 
L is selected from the group consisting of: 
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H 



wherein R 5 , R$, R7, and R 8 are each individually selected from the group 
consisting of H, alkyl, halo, aryl, arylalkyl, aminoalkyl, aminoaryl, oxoalkyl, 
oxoaryl, and oxoarylalkyl; and wherein said compound of Formula I binds the 
minor groove of DNA as a dimer. 

2. The compound of Formula I, wherein L is: 
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A is N, B is NH ; X is O, Y is CH, R,, R 2 , R4, R5, R*. R7. R*. R9 and R 14 are each H, and 
R3 and R13 are each H 2 . 



3. 



The compound of Formula I, wherein L is: 



— N 




H 

A is N, B is NH, X is O, Y is CH, R,, R 2 , R4, R5, R*, R7. R*. R9 and R, 4 are each H 3 and 
R 3 and R 13 are each H 2 . 

4. A method of binding mixed sequence DNA comprising contacting a 
sample DNA with a compound of Formula (I): 



X is selected from the group consisting of 0, S, and NH; 
Y is CH orN; 
A is CH or N; 

B is selected from the group consisting of NH, O or S; 

Rj is selected from the group consisting of H, loweralkyl, halogen, oxyalkyl, 
oxyaryl, and oxyarylakyl; 

R 2 and R 9 are each independently selected from the group consisting of H, H 2 , 
hydroxy, lower alkyl, cycloalkyl, aryl, alkylaryl, alkoxyalkyl, hydroxycycloalkyl, 
alkoxycycloalkoxy, hydroxyalkyl, aminoalkyl and alkylaminoalkyl; and 

R3, R4, R13 and R ]4 arc each independently selected from the group consisting of 
H, lower alkyl, alkoxyalkyl, cycloalkyl, aryl, alkylaryl, hydroxyalkyl, aminoalkyl, and 
alkylaminoalkyl, or R_* and R4 together or R13 and R ]4 together represent a C 2 to C l0 
alkyl, hydroxyalkyl, or alkylene, or R 3 and R4 together or R13 and R 14 together are: 




wherein: 
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wherein n is a number from 1 to 3, and Rio is H or -CONHRi jNR} 5 R 16 , wherein 
Rn is lower alkyl and R| 5 and R| 6 are each independently selected from the 
group consisting of H and lower alkyl; 
L is selected from the group consisting of: 
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wherein R 5 , R 6 , R7, and R 8 are each individually selected from the group consisting of 
H, alkyl, halo, aryl, arylalkyl, aminoalkyl, aminoaryl, oxoalkyl, oxoaryl, and 
oxoarylalkyl; wherein said compound of Formula I binds the minor groove of DNA as 
a dimer. 

5. The method of Claim 4 wherein L is: 



A is N, B is NH, X is O, Y is CH, Ri, R 2 , R*, R5, R6, R7. R8.R9 and R 14 are each H, and 
R 3 and R13 are each H 2 . 

6. The method of Claim 4, wherein L is: 



H 

A is N, B is NH, X is O, Y is CH, R, f R 2 , R4, R& R6, R?, Rs,R9 and R M are each H, and 
R 3 and R13 are each H 2 . 

7. A method of detecting mixed sequence DNA comprising contacting a sample of 
DNA with a fluorescent compound of Formula (i): 






R4 



wherein: 



X is selected from the group consisting of O, S, and NH; 
Y is CH or N; 



A is CH or N; 
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B is selected from the group consisting of NH, O or S; 

Ri is selected from the group consisting of H, loweralkyl, halogen, oxyalkyl, 
oxyaryl, and oxyarylakyl; 

R 2 and Rq are each independently selected from the group consisting of H, H 2 , 
hydroxy, lower alkyl, cycloalkyl, aryl, alkylaryl, alkoxyalkyl, hydroxycycloalkyl, 
alkoxycycloalkoxy, hydroxyalkyl, aminoalkyl and alkylaminoalkyl; and 

R 3 , R4, R]3 and R14 are each independently selected from the group consisting of 
H, lower alkyl, alkoxyalkyl, cycloalkyl, aryl, alkylaryl, hydroxyalkyl, aminoalkyl, and 
alkylaminoalkyl, or R 3 and R4 together or R13 and Rj 4 together represent a C 2 to C10 
alkyl, hydroxyalkyl, or alkylene, or R 3 and R4 together or R13 and R| 4 together are: 



wherein n is a number from 1 to 3, and Rio is H or -CONHR11NR15R16, wherein 
Rn is lower alkyl and R15 and Rje are each independently selected from the 
group consisting of H and lower alkyl; 
L is selected from the group consisting of: 
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wherein R 5 , R 6 , R7, and R 8 are each individually selected from the group 
consisting of H, alkyl, halo, aryl, arylalkyl, aminoalkyl, aminoaryl, oxoalkyl, 
oxoaryl, and oxoarylalkyl; and wherein said compound of Formula I binds the 
minor groove of DNA as a dimer; 

and then observing fluorescence in the sample, the observation of fluorescence 
indicating the compound of Formula I has bound to a sequence of DNA. 

8. The method of Claim 7, wherein L is: 
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A is N, B is NH, X is O, Y is CH, Ri, R 2 , R4, R5, R«, R?, R«, R9 and R 14 are each H, and 
R 3 and R I3 are each H 2 . 

9. The method of Claim 7, wherein L is: 



H 

A is N, B is NH, X is O, Y is CH, R h R 2 , R4, R5, R*, R7. Rg, R9 and R 14 are each H, and 
R 3 and Rn are each H 2 . 

10. A pharmaceutical formulation comprising a compound of Formula T: 



X is selected from the group consisting of O, S ? and NH; 
Yis CH orN; 
A is CH or N; 

B is selected from the group consisting of NH ? O or S; 

Ri is selected from the group consisting of H, loweralkyl, halogen, oxyalkyl, 
oxyaryl, and oxyarylakyl; 

R 2 and R9 are each independently selected from the group consisting of H, H 2 , 
hydroxy, lower alkyl, cycloalkyl, aryl, alkylaryl, alkoxyalkyl, hydroxycycloalkyl, 
alkoxycycloalkoxy, hydroxyalkyl, aminoalkyl and alkylaminoalkyl; and 





wherein: 
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R3, R4, R13 and R14 are each independently selected from the group consisting of 
H, lower alkyl, alkoxyalkyl, cycloalkyl, aryl, alkylaryl, hydroxyalkyl, aminoalkyl, and 
alkylaminoalkyl. or R 3 and R4 together or R13 and R14 together represent a C2 to C10 
alkyl, hydroxyalkyl, or alkylene, or R 3 and R4 together or R| 3 and R )4 together are: 




wherein n is a number from 1 to 3, and Rio is H or -CONHR11NR15R16, wherein 
Rn is lower alkyl and R15 and Rj 6 are each independently selected from the 
group consisting of H and lower alkyl; 
L is selected from the group consisting of: 




H 
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wherein R 5 , R$, R 7 , and R 8 are each individually selected from the group consisting of 
H, alkyl, halo, aryl, arylalkyl, aminoalkyl, aminoaryl, oxoalkyl, oxoaryl, and 
oxoarylalkyl; 

in a pharmaceutically acceptable carrier. 

1 1 . The pharmaceutical formulation of Claim 1 0, wherein L is: 



A is N, B is NH, X is O, Y is CH, R h R 2 , R4, R5, R*, R7, R*. R9 and R, 4 are each H, and 
R 3 and R]3 are each H 2 . 

1 2. The pharmaceutical formulation of Claim 1 0, wherein Lis: 



H 

A is N, B is NH, X is O, Y is CH, R !? R 2 , R4, R 5 , R<>, R? } R», R9 and R, 4 are each H, and 
R3 and R}3 are each H 2 . 
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